Where Do Se-trees Perform? (part I)
نویسنده
چکیده
As a classiier, a Set Enumeration (SE) tree can be viewed as a generalization of decision trees. We empirically characterize domains in which SE-trees are particularly advantageous relative to decision trees. Speciically, we show that: 1. SE-trees excel in domains in which relatively few examples are available; and 2. SE-trees excel in noisy domains. In noisy domains, we discover that SE-trees perform more consistently (measured by the variance in error) in one part of the spectrum, and less consistently in the other; in the lack of noise, we nd that SE-trees are almost invariably more consistent than their decision tree counterparts. Finally, we develop a simple complexity measure based on a target function's syntactic form, and show that SE-trees enjoy a particular advantage in more complex domains.
منابع مشابه
Mineral Chemistry and metamorphic evolution of the Late Neoproterozoic metabasites of Do-Chah metamorphic - igneous complex (SE Shahrood)
Metapelites of the Do Chah complex (SE Shahrood) are composed of micaschist, garnet micaschist, chloritoid schist and garnet-bearing gneiss. In the highest degree of metamorphism, metapelites have been affected by partial melting, resulting as granitization. A significant part of these rocks, imposed by compressional tectonic regime and show typical evidence of plastic deformation and intensive...
متن کاملCounting the number of spanning trees of graphs
A spanning tree of graph G is a spanning subgraph of G that is a tree. In this paper, we focus our attention on (n,m) graphs, where m = n, n + 1, n + 2, n+3 and n + 4. We also determine some coefficients of the Laplacian characteristic polynomial of fullerene graphs.
متن کاملClassification trees as an alternative to linear discriminant analysis.
Linear discriminant analysis (LDA) is frequently used for classification/prediction problems in physical anthropology, but it is unusual to find examples where researchers consider the statistical limitations and assumptions required for this technique. In these instances, it is difficult to know whether the predictions are reliable. This paper considers a nonparametric alternative to predictiv...
متن کاملCannyFS: Opportunistically Maximizing I/O Throughput Exploiting the Transactional Nature of Batch-Mode Data Processing
We introduce a user mode file system, CannyFS, that hides latency by assuming all I/O operations will succeed. The user mode process will in turn report errors, allowing proper cleanup and a repeated attempt to take place. We demonstrate benefits for the model tasks of extracting archives and removing directory trees in a real-life HPC environment, giving typical reductions in time use of over ...
متن کاملModular Semi-automatic Formal Verification of Critical Systems Software ; Modulaire halfautomatische formele verificatie van kritische systeemsoftware
In the first part of this thesis, we present a case study on successfully verifying the Linux USB BP keyboard driver. Our verification approach is (a) sound, (b) takes into account dynamic memory allocation, complex API rules and concurrency, and (c) is applied on a real kernel driver which was not written with verification in mind. We employ VeriFast, a software verifier based on separation lo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007